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METHOD AND SYSTEM FOR PROVIDING A HANDS-FREE FUNCTIONALITY ON 
MOBILE TELECOMMUNICATION TERMINALS BY THE TEMPORARY 
DOWNLOADING OF A SPEECH-PROCESSING ALGORITHM 

Description 

[0001] The present invention relates to a method for carrying out a hands-free 
co m mun i cation using a telecommunication terminal, especially a mobile telecommunication 
terminal, and to a system for providing such a hands-free communication, and to devices 
suitably adapted for use within such a system. 

[0002] The prior art describes voice services that can be called using a telephone and which 
have server-based speech recognition (Automatic Speech Recognition, ASR) implemented 
therein. A dialog system connected to the telephone networlc allows communication between 
these services and a user, the aforementioned speech recognition forming a technical basis for 
this commimication. 

[0003] Such a server-based speech recognition generally includes programs for 
implementing algorithms for processing digitized speech data, and thus for recognizing 
spoken utterances of the user. Usually, in order to improve the recognition, echo cancellation 
and noise reduction methods are carried out in a preprocessing stage of the speech recognition 
on the respective server system connected to the telephone network. 

[0004] Moreover, first attempts have been made to implement similar speech recognition 
systems and corresponding preprocessing algorithms on telecommimication terminals, such as 
a personal digital assistant (PDA), or a multimedia digital assistant (MDA). However, since in 
such terminals, the memory capacity for the software to be permanently installed is generally 
insufficient to provide comprehensive functionality, the preprocessing algorithms used do not 
reach the standard of the server-based speech recognition solutions, especially in terms of 
quality, and, in addition, a much smaller vocabulary is used. 

[0005] A fiirther approach for speech recognition is based on Distributed Speech 
Recognition (DSR), which is described in the literature. Here, the preprocessing is done in the 
telecommunication terminal connected to the telephone network, that is, for example, in a 
mobile PDA, MDA, or the like. In the process, feature vectors resulting from the 
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preprocessing are subsequently transmitted at a reduced data rate over the telephone network 
to a server, where they are fed to subsequent processing stages of a speech recognition unit. 
However, this technology, which requires the definition of new interfaces in the transmission 
network, is still under development and will probably not come to fruition xmtil in a few 
years, provided that reduced data rates then still play an important role in the transmission of 
speech data. 

[0006] Fxuthermore, the legislatures of various coimtries have ruled that hands-free 
telephone systems must be used when telecommunication terminals, such as the 
aforementioned MDA or PDA, or a telephone, including a cordless or mobile telephone, are 
used in a moving vehicle, for example, for the purpose of using voice services. 

[0007] Such hands-free telephone systems generally include a so-called level discriminator 
to prevent feedback between the microphone and the loudspeaker. When extraneous noise is 
present, these level discriminators may cause fluctuations in the volume level, which is not of 
much consequence for interhuman communication, but which, in the case of speech 
recognition, extremely reduces the speech recognition rates of the respective voice services. 
As a result, in particular, such voice services can no longer be used, or xxsed only to a limited 
extent. 

[0008] Unlike in mobile applications, so-called hands-free boxes exist for stationary 
applications in the fixed network. In these hands-free boxes, digital hands-free algorithms are 
implemented on a hardware module, said hands-free algorithms overcoming the 
disadvantages of the level discriminators and allowing improved use, in particular, of voice- 
operated services. 

[0009] It is an object of the present invention to provide a method which is new and 
significantly improved over the aforementioned prior art, and by which an extremely flexible 
hands-free ftmctionality may be provided for telecommunication terminals in general, but in 
particular for the aforementioned mobile telecommimication terminals, which generally have 
only a very limited memory capacity. 

[0010] Most surprisingly, the object of the present invention is already achieved by the 
respective subject matters and features set forth in the independent claims appended hereto. 
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[0011] Advantageous and/or preferred embodiments and refinements are the subject matter 
of the respective dependent claims appended hereto. 

[0012] Thus, the present invention proposes a method for carrying out a hands-free 
co m mun i cation using a telecommunication terminal, especially a mobile telecommunication 
terminal, where at least one program for implementing a speech processing algorithm, 
especially a hands-free algorithm, is temporarily or permanently loaded from a service server 
into the communication terminal and implemented for use, at least for the duration of a 
communication connection. 

[0013] Thus, a particular important advantage is that, due to the only at least temporarily 
loaded algorithm, speech processing ftmctionality is enabled in particular also for hands-free 
talking with telecommunication terminals such as a PDA, MDA or a mobile telephone, which 
have no or only very little memory capacity, and especially ROM capacity, and also that, 
similar to human-to-human commimication, the speech signals may be transmitted during the 
telecommunications connection. 

[0014] Consequently, a voice service, for example one based on a server-based speech 
recognition, such as ASR, can already be used under hands-free conditions using the existing 
interfaces of existing telecommunications networlcs, that is, without the need to additionally 
agree on, or standardize, new or additional interfaces, as is the case, for example, in 
distributed speech recognition DSR. 

[0015] In order to improve the quality and/or to verify transmitted speech signals, especially 
for subsequent speech recognition, a preferred refinement of the present invention provides 
for the loading to include the loading of at least one echo cancellation and/or noise reduction 
algorithm from the service server. Furthermore, if, in addition or alternatively, at least one 
voice and/or speaker verification, recognition, and/or classification algorithm is loadable from 
the service server, then this allows a user and/or a voice to be verified in an application- 
specific manner, for example, as being registered with a service, to be identified, for example, 
from a group of individuals, and/or to be classified as male or female. In a further 
advantageous embodiment, it is possible to load a program for implementing a text-to-speech 
algorithm, that is, for automatic conversion of text into speech. 
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[0016] The speech signals to be transmitted are preferably digitized for transmission, in 
which process the speech signals may additionally be encoded, depending on the 
telecommunication terminal used, for example, based on a terminal operating according to the 
GSM standard. Preferred embodiments of suitably adapted devices therefore include A/D 
and/or D/A converters, and are designed in a system-application specific manner for using, in 
particular, digital algorithms. 

[0017] Based on the, possibly, temporary loading of at least one algorithm fi-om the service 
server, on which, advantageously, a plurality of algorithms are stored for temporary loading, 
provision is made for said server to be located such that it is centrally accessible via at least 
one communication network in order to further increase flexibility, especially with respect to 
provisioning and access capacities. Thus, connections between one or a plurality of 
telecommunication terminals and the service server may easily be established substantially 
independently of location over the at least one communication network, which may be a radio 
communication network, a fixed network and/or the Internet. 

[0018] In accordance with a first preferred embodiment, such a connection may be 
established directly between the service server and a particular telecommunication terminal. 
Preferably, such a connection for loading at least one algorithm or the program for 
implementing an algorithm is established in response to an automatic or user-defined request 
signal by the telecommimication terminal. 

[0019] Furthermore, in particularly preferred embodiments of the present invention, a 
connection between the telecommunication terminal and a server-based speech recognition 
system is established over at least one communication network. 

[0020] Especially in such embodiments, it is additionally or altematively provided that the 
connection between the service server and the telecommimication terminal for, possibly, 
temporarily loading at least one algorithm is established in response to a request signal of the 
server-based speech recognition system. 

[0021] To allow extremely flexible use, the method of the present invention provides that 
the connection between the telecommimication terminal and the at least one communication 
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network be by wire or wireless, in accordance with the specific application. Thus, the present 
invention makes it possible to connect substantially any telecommunication terminal, and to 
carry out the inventive method using essentially any communication network, especially a 
mobile telecommunications network, for example one based on GSM (Global System for 
Mobile Commxmication) or UMTS (Universal Mobile Telecommunication System), a 
(W)LAN network ((Wireless) Local Area Network) and/or a fixed network, as is the case, for 
example, when the telecommimication terminal used is a DECT (Digital Enhanced Cordless 
Telecommunication) telephone. ^ 

[0022] The inventive arrangement of a server-based speech recognition system and/or the 
service server can also be implemented in an extremely flexible and application-specific 
manner. In particular, it is preferred for the server systems to integrated directly into a radio 
co mm unication network or a fixed network. Here, an intelligent network may be included, so 
that the server system or systems is/are disposed, for example, in a service switching point 
and have access to an intelligent periphery. In a complementary or alternative embodiment, 
provision is also made for the server systems to be configured to have connections to the 
Internet using WEB servers, which are essentially computers and/or software which, in a 
network, provide HTTP (Hypertext Transfer Protocol) and access to the Internet. In this case, 
the telecommunication terminals contain interface devices for providing commimication 
connections over the Internet. 

[0023] Thus, using the present invention, a connection between the telecommxmication 
terminal and the service server and/or the server-based speech recognition system and/or 
between the speech recognition system and the service server can particularly advantageously 
be established by setting up a call using respectively assigned identifiers. Consequently, in a 
preferred practical embodiment, the present invention allows the use of a plurality of such 
identifiers, which, in particular, differ according to the specific application, depending on the 
telecommunications networks, servers and/or telecommunication terminals used. Such 
identifiers may include, for example, subscriber numbers and/or service numbers, IP 
addresses, calling line identifiers (CLI - Calling Line Identification; ANI - Automatic Number 
Identification) and/or identification addresses which are assigned to mobile telephones and 
stored in a Home Location Register (HLR) of a respectively associated commimication 
network. 
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[0024] In another advantageous refinement, provision is also made for the 
telecommunication terminal to be configured for multi-channel signal processing. Thus, it 
may also be ensured that, for example, when connecting a plurality of microphones via a 
respective audio and/or stereo input, the quality of especially a noise reduction can be fiirther 
improved significantly by locating the speech source, which will then, in principle, be 
possible. Multi-channel processing can also be carried out on the server, which then requires 
multi-channel or virtually multi-channel (multiplex) transmission between the server and the 
terminal. 

[0025] If the telecommvmication terminal has at least two microphone channels, such as a 
stereo channel, then a hands-fi-ee algorithm which performs multi-channel processing, 
especially for locating the speech source for improved noise reduction, can advantageously be 
loaded into the telecommimication terminal. If the telecommunication terminal additionally 
includes at least two loudspeaker channels and if the signal transmission is multi-channel or 
virtually multi-channel (multiplex), then a stereo or hands-fi-ee algorithm and/or stereo or 
multi-channel echo cancellation, especially for hands-firee transmission including three- 
dimensional perception, can preferably be loaded into the telecommunication terminal. 

[0026] Multi-channel transmission further has the advantage that, for example, in addition to 
speech data, further specific parameters, vector data, test and/or compensation signals, can be 
easily transmitted, which would otherwise have to be transmitted embedded in the mono 
signal together with the speech data, as required. 

[0027] In particular, using such test and/or compensation signals, the algorithm used can be 
tested, at substantially all times, with respect to the respective current environment conditions. 
For this purpose, preferably, a comparator unit is provided by which a test signal that is output 
by a loudspeaker on the side of the telecommunication terminal is compared to the signal 
receivable via a microphone of the teleconmiunication terminal. 

[0028] Depending on the specific application, such a test is performed in response to a test 
signal which is transmitted by the server-based speech recognition system and/or the service 
server, or which is generated by the telecommunication terminal. The present invention 
further includes embodiments in which the actual comparison check of the two signals is 
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performed immediately in the telecommunication terminal, or after the received signal is 
transmitted back to one of the server-based systems. 

[0029] Thus, the updating of an algorithm, i.e., the adjustment, adaptation, or replacement of 
the at least one used algorithm according to the respective current environment, is performed 
in response to the test result, for example, by loading an appropriate program from the service 
server or, if a plurality of algorithms are, at least temporarily, loaded on the 
teleconununication terminal, by the telecommunication terminal autonomously selecting the 
appropriate algorithm. 

[0030] In order to further increase the quality of the speech recognition and the flexibility in 
the use of different frequency spectra and/or bands, the present invention preferably also 
provides a functionality for converting the speech signals for transmission between 
communication units operating at different frequencies, for example, from a 
telecommunication terminal processing speech signals on a 30 kHz basis for a communication 
connection provided on an 8 kHz basis by a commimication network used, which may be 
followed by re-conversion to 30 kHz via a conversion unit respectively associated with the 
server-based speech recognition. 

[0031] According to a further advantageous refinement of the present invention, it is further 
proposed that specific identification parameters and/or charging parameters be transmitted by 
the telecommunication termined for further processing and recorded by a device associated 
with the speech recognition system and/or with the service server. 

[0032] Thus, especially if, on the service server and/or the server of the speech recognition 
system, such identification parameters are associated with algorithms that are particularly 
suited for the respective telecommunication terminal, then, based on parameters transmitted in 
this manner, suitable algorithms to be temporarily loaded can be advantageously preselected 
in a time-saving manner already when a telecommunication terminal logs on again and/or 
repeatedly to one of the server systems. 

[0033] Furthermore, using application-specific charging parameters, possibly in conjunction 
with the identification parameters, a preferably automatic invoicing and/or debiting process 
associated with the telecommunication terminal and/or the user of the telecommunication 
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terminal for invoicing and/or debiting chargeable services and/or algorithms can be 
implemented very easily with essentially all invoicing and/or debiting methods generally 
known for this purpose. 

[0034] Moreover, in a practical embodiment of the present invention, an analog-to-digital 
conversion and/or a digital-to-analog conversion to be performed on the side of the 
telecommunication terminal is calibrated before or during the use of a temporarily 
implemented algorithm. Such caUbration can be performed once for a communication 
connection, or continuously. In particular, for flexible, environment-based use or selection of 
suitably adapted algorithms of a common group, or of a common provider, it is also 
advantageous for a calibration to be performed digitally, especially using a processor which 
forms part of the telecommunication terminal and executes a particular algorithm. 

[0035] Moreover, it is provided that compensation signal used for the thus substantially 
universally usable calibration is the speech signal itself and/or suitably designed test signals, 
such as a noise signal emitted during speech pauses via the loudspeaker of the 
telecommunication terminal and received back via the microphone of the telecommunication 
terminal. 

[0036] Thus, the present invention further includes a system, in particular as set forth in the 
appended claims, which is suitably designed for carrying out the inventive method and which, 
in the particiilar embodiments thereof, has advantages identical or comparable to those 
mentioned above. 

[0037] Further advantages and characteristics of the present invention will be apparent from 
the following description of preferred, but only exemplary embodiments of the invention with 
reference to the accompanying drawings. 

[0038] In the drawings, 

[0039] Figure 1 is a greatly simpUfied schematic representation of a system according to the 
present invention; and 



8 



520.1061 



[0040] Figure 2 is a simplified block diagram for illustrating a local processing principle for 
the inventive hands-fi*ee functionality on a mobile telecommunication terminal according to 
the present invention. 

[0041] In the following, preferred embodiments of the present invention will be described 
by way of example with reference to Figures 1 and 2, which show a schematic representation 
of a hands-free system according to the present invention and a block diagram for illustrating 
a local processing principle according to the present invention for using a h£uids-fi:ee 
functionality on a mobile telecommimication terminal. 

[0042] Figure 1 shows a mobile telecommunication terminal 100 which is able to access a 
telecomm\mications network 200 via an air interface, for example, by radio, as indicated by 
double arrow 1. The double arrow further indicates that a duplex commimication, 
conveniently a full-duplex communication, is provided via the air interface. In the case under 
discussion, telecommunication terminal 100 is a mobile telephone, a PDA, or an MDA, 
which, on the basis of a GSM standard, is able to communicate over a mobile 
telecommunications network which, in the present case, is included in telecommunications 
network 200, and thus, to transmit speech data over network 200 in a manner corresponding 
to hmnan-to-himian conmiimication. 

[0043] However, it should be pointed out that the mobile telecommunications network and 
the telecommimication terminal 100 associated therewith may also be based on a different 
standard, such as a UMTS Standard. Moreover, it should be noted that the term 
"telecommimications network" used in the following description and the appended claims 
should be understood to generally refer to only one commimications network or to a plurality 
of communications networks, including voice/data networks and data/data networks. 

[0044] Via at least one further interface, which is indicated by double arrow 2, a voice- 
operated CT server (computer telephony server) 300 having speech recognition algorithms is 
connected permanently or as needed to telecommunications network 200, which is suitable 
for transmission of speech data, said connection being established directly via the mobile 
telecommimications network or via further communication networks not shown. 
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[0045] Furthermore, a connection 3 is established permanently or as needed to a service 
server 400 containing a plurality of digital hands-free algorithms and, possibly, further audio 
signal preprocessing algorithms, such as, in particular, echo cancellation and/or noise 
reduction algorithms. 

[0046] The illustrated system configuration further includes a third server 500, which is part 
of a charging or charge invoicing and debiting system, which is essentially a so-called billing 
system or billing support system (BSS). In the case under discussion, a simplex connection 4 
can be established to said third server via telecommunications network 200. 

[0047] Servers 300, 400 and 500 preferably include direct connections 5, 6 allowing them to 
communicate and/or exchange data between each other, so that, in an alternative embodiment, 
only connection 2 from servers 300, 400 and 500 to telecommunications network 200 is 
needed for carrying out the inventive method, which will be described in detail below. In 
another alternative embodiment, servers 300, 400 and 500 may be part of a shared server 
system. 

[0048] When using the illustrated system configuration in accordance with a preferred 
embodiment in which servers 300, 400 and 500 are in the form of, for example, a WEB 
server, a service server 400 provides in each case at least one program for implementing a 
hands-free algorithm for mobile telecommunication terminal 100, said at least one program 
being able to be loaded from the Internet via telecommunications network 200. This at least 
one program is temporarily loaded into mobile telecommunication terminal 100 and 
implemented thereon for using a voice service provided by server 300. Since, generally, a 
main memory is already sufficient for temporary loading, mobile telecommunication terminal 
100 needs essentially no hard disk storage capacity in this case. However, in specific 
applications, it is still possible to use such storage capacity. 

[0049] Depending on the environment condition currently present, for example, when using 
mobile telecommunication terminal 100 in a particular vehicle which represents a different 
noise environnaent than when using mobile teleconmiunication terminal 100 in the open air or 
in a differently designed vehicle, a respectively suitable algorithm of the plurality of 
algorithms provided on a mass stomge unit in service server 400 can thus in each case be 
temporarily loaded into teleconmiunication terminal 100 and implemented thereon, even in 
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the case of extremely limited memory capacity. After the memory capacity has been used for 
a specific purpose, it is made available for other applications again. 

[0050] Depending on the respective, especially application-specific design, the at least one 
algorithm is transmitted, for example, when server 300 and or server 400 is/are called for the 
first time based on a particular service subscribed, or when directly requested by the user of 
mobile telecommunication terminal 100. 

[0051] Thus, unlike in the case of distributed speech recognition, i.e. unlike in DSR, no 
feature vectors are required during the subsequent conmiunication between mobile 
telecommunication terminal 100 and speech recognition system 200 and/or when establishing 
a communication connection to another telecommunication terminal. Rather, regular speech 
data is transmitted, which, in the example under discussion, is GSM-coded speech data. 

[0052] As can be seen from Figure 2, a preferred embodiment provides for the mobile 
telecommunication terminal 100 to include a transmitting/receiving imit 101, an encoder unit 
102, and a processor unit 103 which is connected to the temporary memory and is able to 
execute an algorithm temporarily loaded into the memory. Processor unit 103 is connected to 
a digital-to-analog converter 105, which is connected to an internal loudspeaker 108, or which 
may additionally or alternatively be connected to an external loudspeaker 110, for example, 
via an infrared or Bluetooth interface, or via a wired interface. A connection from an internal 
microphone 107 to processor unit 103, or, analogously, from an external microphone 109 to 
said processor imit via an interface, is provided via an interposed analog-to-digital converter 
104. Furthermore, a calibration control unit 106, which is capable of being controlled by 
processor imit 103, is provided for calibrating converters 105 and 104. Advantageously, 
converters 104 and 105, or an associated unit, additionally provide signal amplification, 
especially an adjustable signal amplification. 

[0053] Depending on the specific design, a calibration of converters 104 and 105 is 
performed once each time teleconmiunication terminal 100 is placed into service, or during 
operation, for example, continuously or monitored in a time-based manner. 

[0054] It is also possible to perform a digital calibration, for example, based on the signal 
which is present at processor imit 103 and which is fed to converter 105 or received by 
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converter 104. Such calibration is preferably specifically matched to a particular group of 
temporarily loadable algorithms, especially using a suitable assignment and/or combination 
scheme or schemes. 

[0055] Thus, in accordance with the embodiment of Figure 2, before digital speech signals 
which are transmitted from speech recognition system server 300 over telecommunications 
network 200 to mobile telecommunication terminal 100 are output on loudspeaker 108 or 110, 
they are suppUed to the hands-free algorithm, which is activated by processor unit 103, in 
digitized form for fiirther processing and subsequently fed to loudspeaker 108 and/or 
loudspeaker 1 10 via digital-to-analog converter 105. 

[0056] Analogously, a speech signal received by microphone 107 and/or microphone 109 is 
digital-to-analog converted by converter 104, possibly amplified in a suitably adapted 
manner, and subsequently fed to processor vmit 103 and processed by the activated hands-free 
algorithm before it is transmitted over telecommunications network 200. 

[0057] Thus, without the need to additionally agree on, or standardize, new interfaces, as 
would otherwise be the case with DSR, the present invention allows voice services to be used 
under hands-free conditions already imder the present conditions, using the existing 
interfaces, and thus, to be used in particular also inside a vehicle. 

[0058] Since, as mentioned above, additional algorithms are preferably provided on service 
server 400 for at least temporary loading, a very particularly preferred embodiment provides 
that, in addition to the at least one hands-free algorithm, for example, noise reduction 
algorithms are also temporarily loaded into telecommunication terminal 100 in an analogous 
manner in order to be executed by processor unit 103. 

[0059] If, for example, the latter algorithms are executed on mobile telecommimication 
terminal 100 in a vehicle, and if the mobile telecommunication terminal allows connection of 
several microphones, for example, via a stereo input, then it is also possible to significantly 
further improve the noise reduction quality by locating the speech source, i.e., by locating the 
speaker or the user of mobile telecommunication terminal 100, which will then, in principle, 
be possible. However, when executing a noise reduction algorithm directly on speech 
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recognition system server 300, there is generally only a mono signal available, which does 
allow noise reduction, but generally does not allow location finding. 

[0060] Telecommunication terminal 100, server 300 and/or service server 400 preferably 
transmit charging and/or identification parameters to charging server 500 for the period of use 
of the speech recognition service provided via server 300 and/or for the use of an algorithm of 
service server 400, said parameters allowing the service to be invoiced. The invoicing and/or 
debiting of accounts can be done using essentially all methods that are knovra or which will 
be developed in the future. 

[0061] The algorithm or algorithms carried out by processor imit 103 is/are checked for 
suitability imder the current conditions, preferably using a compensation signal which is, for 
example, packed into a noise signal and output via loudspeaker 108 or 1 10 during speech 
pauses, received as a response signal via microphone 107 and/or microphone 109, and 
compared to the signal that was output. 

[0062] When providing a suitable signal generator (not shown), such a test or compensation 
signal may be independently generated by the mobile telecommunication terminal, especially 
when mobile telecommunication terminal 100 has loaded thereon a plurality of algorithms 
capable of being selected for activation. However, such test or compensation signals can also 
be transmitted by server 300 and/or server 400 to mobile telecommimication terminal 100 
and, upon receipt of the response signal, be compared to said response signal on the server or 
on a respectively associated test unit as to whether the currently activated algorithm is 
suitable, so that, if necessary, a suitably adapted and updated algorithm is transmitted firom 
service server 400 to mobile telecommunication terminal 100 and temporarily loaded thereon. 

[0063] If mobile telecommunication terminal 100 has a single-channel design, such a 
compensation or test signal is preferably embedded into the speech signal as a noise signal. In 
the case of a mobile telecommunication terminal 100 having two channels, said compensation 
or test signal may be transmitted over the additional channel. 

[0064] The present invention fiirther provides that, in the case of a two-channel mobile 
telecommunication terminal 100, additional parameters, such as the aforementioned 
identification parameters, further data and/or possibly also feature vectors be transmitted over 
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the additional channel, i.e. essentially independently of the speech data, but possibly 
depending on the particular algorithm used, provided that the telecommvmications network 
200 used and/or interfaces 1 and 2 are suitably designed for this purpose. 

[0065] The present invention further includes embodiments in which interfaces 1 and 2 have 
frequency bands that are different from the frequency band of mobile teleconmiunication 
terminal 100. For example, if the signal processing performed by telecommimication terminal 
100 is based on a 30kHz band, telecommimication terminal 100 preferably contains a 
conversion unit to convert the 30kHz speech signal to, for example, an 8kHz speech signal for 
transmission to voice-operated CT server 300. A conversion unit associated with CT server 
300 possibly reconverts the so-received signals to the original 30kHz signal in accordance 
with the specific application prior to speech recognition. Such signals, which may have to be 
converted, are identified using, for example, the additionally transmitted data or parameters 
mentioned above. 

[0066] The present invention further includes embodiments in which algorithms to be 
transmitted are preselected based on identification parameters which specify 
telecommunication terminal 100 and which are transmitted and/or requested when speech 
recognition server 300 is called by telecommimication terminal 100. Such preselected 
algorithms can be preset for the specified teleconmiunication terminal 100, or may, in the 
past, have turned out to be suitable algorithms, for example based on an environment 
condition detected in the past with respect to teleconmiunication terminal 100. Subsequently, 
service server 400 is accordingly requested, for example via connection 5, to transmit the 
selected or preset algorithm. However, preselection is also possible in an analogous maimer 
by-passing speech recognition server 300, that is, via interfaces 1 and 3 according to Figure 1. 

[0067] Such identification parameters may vary in accordance with the specific application 
and, depending on the telecommimication terminal used, may include an IP address, a CLI 
and/or a parameter extracted by server 300 from an HLR associated with telecommunication 
terminal 100. 

[0068] Moreover, it is not essential to the present invention that telecommunication terminal 
100 be a mobile telecommimication terminal. In principle, the telecommunication terminal 
used in the present invention can also be a stationary telecommunication terminal, or a 
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telecommunication terminal that is permanently incorporated in a vehicle, and, depending on 
the underlying system, can also be designed to include a DECT, Bluetooth, (W)LAN, or other 
interface, including also a wired interface, for access to the corresponding network. 

[0069] The entire telecommunications network 200 used may therefore vary in accordance 
with the specific apphcation and may include, for example, mobile telecommunications 
networks, (W)LAN, fixed networks and/or the Internet 

[0070] Moreover, the telecommunications network used may include an intelligent network, 
in which case at least speech recognition system server 300 is preferably disposed in a service 
node and advantageously has access to an intelligent periphery. 

[0071] In further preferred embodiments, service server 400 may also be designed directly 
with telecommunication terminal 100 to provide algorithms, while by-passing 
telecommimications network 200. Thus, for example, it is proposed in particular that service 
server 400 be part of an intelligent unit which is accommodated, for example, in a vehicle and 
on which a plurality of algorithms are available and that it be appropriately supplied with 
current algorithms over the telecommimications network from a central server unit, not shown 
in Figure 1. Thus, a respectively suitable algorithm may be temporarily loaded from such a 
disposed service server into telecommunication terminal 100 also via a direct connection to 
telecommimication terminal 100. 

[0072] Using call identifiers respectively assigned to the individual system components 100, 
300, 400 and/or possibly 500 in an arrangement selected in accordance with the specific 
application, or in a given arrangement, it is thus possible to establish respective desired or 
necessary conmiunication coimections in a location-independent maimer between the 
individiml devices and/or systems for carrying out the method of the present invention. 

[0073] Thus, such identifiers include in particular subscriber nimibers and/or service 
numbers, IP addresses, calling line identifiers and/or identification addresses which are 
assigned to mobile telephones and stored in a home location register of a respectively 
associated commimication network. 
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[0074] Thus, using the present invention, it is no longer necessary for a speech processing 
functionality, especially a hands-free and/or noise reduction or speech recognition 
functionality, to be permanently installed on a telecommunication terminal 100, so that the 
invention may be used especially in telecommunication terminals which have no or only very 
little memory, or which have insufficient capacity available in this memory, or if this capacity 
is intended to be used for other purposes. 

[0075] Thus, for example, telecommunication terminal ICQ may be pre-configured such that, 
when establishing a communication connection, it first automatically establishes a connection 
to service server 400 for temporarily loading one or possibly a plurality of algorithms into, 
and implementing the same on, telecommunication terminal 100, and that telecommunication 
terminal 100 may then appropriately select the respectively suitable algorithm. 

[0076] If an implemented algorithm turns out to be no longer suitable and/or when the 
desired communication connection is terminated, the memory capacity, for example of the 
main memory, that is occupied by the algorithm is freed for other applications. 
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